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(57) Abstract: The Video Motion Anomaly Detector addresses the problem of automatically detecting events of interest to operators 
O of CCTV systems used in security, transport and other applications, processing CCTV images. The detector may be used in a number 

of ways, for example to raise an alarm, sunmioning a human operator to view video data, or to trigger selective recording of video 
1^ data or to insert an index mark in recordings of video data. 
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VIDEO MOTION ANOM ALY DETECTOR 

The present invention relates to devices and naethods for processing video images to 
raise an alarm signal when an event of interest is detected. 

5 

Qosed circuit television (CCTV) is widely used for security, transport and other 
purposes. Example applications include the observation of crime or vandalism in 
public open spaces or buildings (such as hospitals and schools), intrusion into 
prohibited areas, monitoring the free flow of road traffic, detection of traffic incidents 
10 and queues, detection of vehicles travelling the wrong way on one-way roads. 

The monitoring of CCTV displays (by human operators) is a very laborious task 
however and there is considerable risk that events of interest may go unnoticed. This is 
especially true when operators are required to monitor a number of CCTV camera 

15 outputs simultaneously. As a result in many. CCTV installations, video data is recorded 
and only inspected in detail if an event is known to have taken place. Even in these 
cases, the volume of recorded data may be large and the manual inspection of the data 
may be laborious. Consequently there is a requirement for automatic devices to 
process video images and raise an alarm signal when there is an event of interest. The 

20 alarm signal can be used either to draw the event to the immediate attention of an 
operator, to place an index mark in recorded video or to trigger selective recording of 
CCTV data. 

Some automatic event detectors have been developed for CCTV systems, though few 
25 of these are very successful. The most common devices are called video motion 
detectors (VMDs) or activity detectors, though they are generally based on simple 
algorithms concerning the detection of changes in the brightness of the video image - 
not the actual movement of imaged objects. For the purposes of detecting changes in 
brightness, the video image is generally divided into a grid of typically 16 blocks 
30 horizontally and vertically (i.e. 256 blocks in total). There several disadvantages of 
these algorithms. For example, they are prone to false alarms, for example when there 
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are changes to the overall levels of illumination. Furthermore, they are imable to detect 
the movement of small objects, because of the block-based processing. In addition, 
they cannot be applied if the scene normally contains moving objects which are not of 
interest. These disadvantages can be reduced to a lin[iited extent by additional 
5 processing logic, but the effectiveness of standard VMDs is inherently limited by the 
use of change detection as the initial image-processing stage. 

There is another type of detection device, which is characterised by the use of complex 
algorithms involving image segmentation, object recognition and tracking and alarm 
10 decision rules. Though these devices can be very effective, they are generally 
expensive systems designed for use in specific applications and do not perform well 
without careful tuning and setting-up, and may not work at all outside of a limited 
range of applications for which they were originally developed. 

15 US Patent No. 6,081,606 by inventors Wade & Jeffrey describes an apparatus and a 
method for detecting motion within an image sequence. That document discloses that 
motion within an image may be calculated by correlating areas of one image with areas 
of the next image in the video to generate a flow field. The flow field is then analysed 
and an alarm raised dependent on the observed magnitude and direction of flow. 

20 

European Patent No 0 986 912 describes a method for monitoring a predetermined 
surveillance region. A video image is divided into a number of segments. A statistical 
distribution for the mean grey level is determined for each segment. A change in mean 
grey level for a segment, outside the usual statistical variation, may be used to trigger 
25 an alarm. 

The Video Motion Anomaly Detector addresses the problem of automatically detecting 
events of interest to operators of CCTV systems used in security, transport and other 
applications, processing CCTV images. The detector may be used in a number of 
30 ways, for example to raise an alarm, summoning a human operator to view video data. 
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or to trigger selective recording of video data or to insert an index mark in recordings 
of video data. 

Accordingly, the present invention provides a method for processing video images to 
5 detect an event of interest, comprising the steps of: receiving a video signal 
representing the video images to be processed; extracting at least one point feature 
from the video signal; tracking the position and movement of the at least one point 
feature within the video images to generate a corresponding at least one track, each . 
representing a corresponding point feature; using an iterative learning process to derive 
10 a normal pattern of behaviour for each track; comparing present behaviour of the at 
least one track to the respective normal pattern of behaviour; and in response to the 
present behaiviour falling outside the normal pattern of behaviour, generating an alarm 
signal . 

15 The alarm signal may cause at least one of the following effects: draw the attention of 
an operator, place an index mark at the appropriate place in recorded video data; and 
trigger selective recording of video data. 

The learning process may accumulate data representing the behaviour of the track(s) 
20 over a period of time in a four-dimensional histogram, said four dimensions 
representing x-position, y-position, x-velocity and y-velocity, of the track(s) within the 
video image. Furthermore, the learn behaviour stage may segregate the tracks 
according to a velocity threshold; wherein tracks moving at a velocity below the 
velocity threshold are considered stationary while tracks moving at a velocity in excess 
25 of the velocity threshold are considered mobile; wherein data concerning the mobile 
tracks is stored in said four-dimensional histogram, data concerning the stationary 
tracks being stored in a two-dimensional histogram, said two dimensions representing 
x-position and y-position within the video image. Furthermore, a cell size of the four- 
dimensional histogram may vary in accordance with a measured speed in the image of 
30 each respective track. The histogram may be periodically de-weighted in order to bias 
the result of the learning process towards more recent events. 
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The comparison process may classify a track according to a comparison of the 
frequency of occupation of the corresponding histogram cell with an occupancy 
threshold. The comparison process may act to classijty as normal behaviour a track 
5 adjacent or near a cell which is above the occupancy threshold, despite the track 
appearing in a cell below the occupancy threshold, where one cell is considered to be 
near another if the distance between them is below a predetermined distance threshold. 

Abnoraial tracks may be filtered, whereby an active alarm signal is generated in 
10 response to an abnormal track which resembles a number of other abnormal tracks, in 
terms of at least one of position, velocity and time. 

Abnormal tracks may be filtered, whereby an active alarm signal is generated in 
response only to an abnormal track which has been classified as abnormal on a 
15 predetermined number of occasions. 

Abnormal tracks may be filtered, whereby an active alarm signal is generated in 
response only to a track being classified as abnormal for the first time. 

20 Abnoraial tracks may be filtered, whereby an active alarm signal is generated only in 
response to a filte?red version of the classification rising above a predetermined 
threshold value. 

Subsequent active alarm signals may be inhibited for a predetermined time interval 
25 after a first active alarm signal has been produced. 

Subsequent active alarm signals may be inhibited if caused by an abnormal track within 
a predetermined distance of another track which has previously generated an alarm. 



30 The present invention also provides apparatus for processing Video images to detect an 
event of interest, comprising: a source of video images, producing a video signal 
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representing the video images to be processed; a feature extraction device receiving 
the video signal and producing data representing at least one point feature detected 
within the image; a feature tracking device receiving the data representing point 
features and producing data representing tracks, being representative of the position 
5 and speed of each respective point feature, within the image; a learning device 
receiving the data representing the tracks and producing a signal representing a range 
of behaviour considered normal by the learning device, in response to operation of a 
learning process on the data representing the tracks; a classification device receiving 
both the signal representing the normal range of behaviour of the tracks and the data 
10 lepresenting the tracks, being adapted to compare the signal and the data and to issue a 
normal/abnormal signal in accordance with the outcome of such comparison; and an 
alarm generation device receiving the normal/abnormal signal and generating at least 
one active alarm signal in response to the normal/abnormal signal indicating abnormal 
behaviour of at least one track. 

15 

The above, and further, objects, characteristics and advantages of the present invention 
will become more apparent from the following description of certain embodiments 
thereof, with reference the Fig. 1 which shows schematic block diagram of a video 
motion anomaly detector according to the present invention. 

20 

The present invention provides a video motion anomaly detector that is feature-based, 
and generates alarms on the basis of abnormal behaviour of a track representing the 
behaviour of a point feature. The known systems do not have these characteristics. 

25 

The video motion anomaly detector extracts and tracks point-like features in video 
images and raises an alarm when one or more track(s) each representing a feature is/are 
behaving abnormally compared with the behaviour of those tracks observed over a 
period of time. By '^behaviour" we mean the movement of tracks in different parts of 
30 the video image. For example, rapid movement of features in a particular direction in 
one part of the field of view may be normal, but it may be abnormal if it occurred in 



wo 2004/017273 



PCT/GB2003/002963 



-6- 

another part of the field of view where the normal behaviour is slow movement. 
Similarly, rapid movement in the same part of the field of view may be abnormal if the 
movement is in a different direction. 

5 Fig. 1 shows the main processing stages in the video motion anomaly detector. 

An incoming video signal 10 is provided by a source of video images, for example a 
CCTV camera. The video signal 10 represents video images generated by the source of 
video images. Typically, the video images will show a view of an area surveyed by a 
10 CCTV security camera. 

The video signal 10 is supplied to a feature extraction stage 1. The feature extraction 
stage 1 locates point-like features in each processed image in the video image 
sequence. A suitable feature extraction process is described in Patent No: GB22 18507, 
15 "Digital Data Processing". The located features are identified in a signal 12 provided 
to the next stage, feature tracking stage 2. 

The feature tracking stage 2 tracks features so that each point-like feature can be 
described by its current position and its estimated velocity in the image. The feature 
20 tracking stage 2 provides a track, that is a signal 14 indicating the current position and 
estimated velocity in the image, for each tracked feature, to a leam behaviour stage 3 
and to a track classification stage 4. 

The leam behaviour stage 3 accumulates information about the behaviour of features 
25 over a period of time. One way of doing this is to accumulate a four-dimensional 
histogram, the four dimensions of the histogram being x-position, y-position, x- 
velocity, y-velocity. 

The track classification stage 4 classifies each track as being 'normal' or ^abnormal', as 
30 compared with the behaviour information accumulated by the leam behaviour stage 3. 
One way of classifying a track is to compare the firequency of occupancy of the 
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corresponding histogram cell with a threshold. If the frequency of occupancy is below 
the threshold, the track is classified as abnormal, otherwise it is considered normal. 
The track classification stage 4 sends a normal/abnormal signal 18 to an alarm 
generation stage 5. If any of the tracked features show abnormal behaviour, then this 
5 signal will inform the alarm generation stage 5 of the abnormality. 

The alarm generation stage 5 generates an alarm signal 20 in response to an active 
normal/abnormal signal 18, indicating that at least one abnormal track has been found 
to be present. Additional processing logic may be provided to resolve situations such 
10 as intermittent abnormal behaviour or multiple instances of abnormal behaviour 
associated with one real-world event, and other such situations. 

The video motion anomaly detector of the present invention preferably provides at least 
one of the following features: the use of point feature extraction and tracking in an 
15 event detector; and the detection of events by classification of track behaviour as being 
abnormal, compared with the behaviour of tracks observed over time. 

Compared with event detection based on a known video motion detection, the video 

motion anomaly detector of the present invention provides at least some of the 
20 following advantages. 

It is based on point-feature extraction rather than detecting changes in image 

brightness. This makes the system of the invention relatively insensitive to changes in 

scene illumination levels. Scene illumination levels are major source of false alarms in 

known video motion detectors. 
25 - Because it is based on the detection of point features rather than block processing, the 

system of the invention is able to detect the movement of even relatively small objects, 

and to indicate an alarm condition if the movement is unusual. 

- The system of the invention accumulates information about the behaviour of each 
tracked feature, enabling it to build up a definition of ^normal' behaviour for each 
30 feature. The system of the invention can then detect movement of interest, that is. 
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movement departing from 'normal* behaviour for that particular feature, even in the 
presence of other features moving "normally*. 

- The system of the invention detects abnormal behaviour, that is, behaviour departing 
from the calculated "normal* behaviour for a particular feature, rather than pre-defined 
5 specific behaviour. The invention may accordingly be applied to a very wide range of 
different applications with little special setting-up, and can adapt to long-term or 
permanent changes in the viewed image. 

Compared with existing event detection systems based on complex software solutions, 
10 the video motion anomaly detector of the present invention is a relatively simple 
system suitable for implementation in relatively inexpensive hardware. 

The main processing stages of the video motion anomaly detector are now described in 
more detail, with reference to Fig. 1. 

15 

Fig. 1 schematically shows the main processing stages in a video motion anomaly 
detector of the present invention, receiving video information 10, and producing an 
alarm signal 20. 

20 The feature extraction stage 1 locates point-like features in each processed image in the 
video image sequence. A preferred feature extractor is described in Patent No: 
GB2218507, "Digital Data Processing" and this is further described by Harris and 
Stephens in the Proceedings of the 4th Alvey Vision Conference, September 1988, 
University of Manchester, "A combined comer and edge detector". An important 

25 aspect of this feature extractor is that it provides feature attributes, i.e. quantitative 
descriptions of the extracted features in addition to their position in the image. 
Compared with other point-feature extraction algorithms the Harris method is 
particularly robust. 

30 Other point-feature extraction techniques have been developed, and could be used 
within the present invention. For example, one such technique is described by 
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Moravec in Tech Report CIVIU-RI-TR-3, Caniegie-Mellon University. Robotics 
Institute, Sep 1980 "Obstacle avoidance and navigation in the real world by a seeing 
robot rover". Other ad hoc schemes can be envisaged. 

5 Indeed, any algorithm that can be used to extract image features that can be associated 
with a locality can be used as a point-like feature extractor in the present invention. As 
examples, knot-points, that is to say points of high curvature on edge features, can be 
assigned a position. A feature such as an image region, for example an entire vehicle or 
person, segmented by edge-finding or region growing techniques, can be assigned the 

10 position of its centroid. 

The feature extractor 1 may employ a fixed feature-strength threshold, in which case 
the number of extracted features may vary from frame to frame and it may vary from 
one application to another. Altematively, the threshold may be varied so that a fixed 
15 number of features are extracted. 

The feature tracking stage 2 tracks features between image frames so that each point- 
like feature can be described by its current position and its estimated velocity in the 
image. 

20 

In the present application a multi-target tracking algorithm is required as a scene may 
contain a large number of moving objects and each may give rise to a number of 
extracted features. For example, a car passing through the field of view may generate a 
number of extracted features on each video frame, depending on the spatial resolution 
25 used, and a traffic scene may contain a number of cars moving in different parts of the 
field of view. Tracking algorithms are themselves relatively well-known and 
understood. A treatise on the subject is given by Blackman & Popoli in *T)esign & 
Analysis of Modem Tracking Systems", Artech House 1999. 



30 Tracking is a cyclic process. At any time, a number of objects may be being tracked. 
Their current position and velocity are known and as each new firame of video data is 
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presented, the track information needs to be updated. Such a tracking algorithm 
typically consists of the following stages. 

- Plot (feature)-to-track association: this is the process of deciding which of the 
5 features, extracted from the most recent video frame, is the same object as that 
represented by any particular track. As "plot" rather than "feature'* is the normal term 
used in discussion of tracking algorithms, "plot" will be used in this section of the 
description. A standard approach is to only consider for association plots that fall 
within a window or gate, which is centred on the plot's predicted position (see below). 

10 Many tracking algorithms then employ simple rules to handle situations where more 
than one plot falls in the acceptance gate. Example rales include 'associate the plot 
nearest to the predicted position*, or 'associate neither*. These options will be 
discussed below. la the present application the density of plots is typically high and the 
possibility of plot-to-track association error is high so the preferred approach makes use 

15 of the similarity of plot (feature) attributes as well as plot position in making the plot- 
to-track association decision. Other schemes for resolving ambiguities are possible, for 
example probabilistic matching, multiple-hypothesis tracking and deferred decision 
making. Other refinements may be employed to improve performance, for example 
cross-correlation of image patches to confirm the similarity of imaged features, or 

20 variation of the acceptance window size according to the expected accuracy of 
prediction. This latter option will allow well established and slow moving tracks to 
have a smaller acceptance window than fast moving or newly formed tracks. Bi- 
directional matching schemes are also possible. 

25 - Track Maintenance: this is the process of initiating new tracks, typically because a 
new object has come into view, and deleting tracks, typically because a tracked feature 
is no longer in view. Most tracking algorithms will also have a track confirmation 
process for deciding whether a track is sufficiently well-established as to be the basis 
for subsequent decision making. In a preferred implementation, new tracks are 

30 initiated from plots that cannot be accounted for after plot-to-track association with 
existing tracks. Because of uncertainties in the performance of the feature extractor, 
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tracks are not immediately deleted in the preferred implementation if they are not 
associated with any plot. Instead, a track may "coast" for a number of frames before 
deletion. Tracks are confirmed once they have been successfully associated with plots 
on a nimiber of frames. 

5 

Tracking feature of low feature strength is problematic, because they are not reliably 
detected on each video frame. Tracking errors become more common when the tracks 
are closely spaced. One way of reducing this problem is to ignore unmatched plots of a 
low feature-strength, and only initiate new tracks for unmatched plots of a higher 
10 feature-strength. 

- Track filtering and prediction: this is the process of estimating the current plot 
position and velocity, and predicting the expected plot position on subsequent image 
frames. This process is required because measurements of feature positions may be 

15 imperfect because of pixel quantisation and image noise, and objects may move 
appreciably between image frames. A number of methods are applicable, for example 
recursive Kalman or Alpha-Beta filtering, fitting polynomial spUnes to recent plot data . 
etc. Performance here may be improved by a number of schemes, for example outlier 
removal -and varying the order of a polynomial spline according the length of a track's 

20 history. 

In general, tracking algorithms follow a common pattern though individual 
implementations may vary in detail according to application-specific factors and other 
issues. 

25 

The leam behaviour stage 3 accumulates information about the behaviour of tracks, 
representing tracked features, over a period of time. The preferred way of doing this is 
to accumulate a four-dimensional histogram, the four dimensions of the histogram 
30 being x-position, y-position, x-velocity, y-velocity. Alternatively, other co-ordinate 
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systems might be used such as polar co-ordinates or Hough transform space co- 
ordinates, etc. 

The behaviour histogram, being four-dimensional, may require a large amount of data 
5 storage and this may be impractical for implementation in processing hardware. The 
preferred way of overcoming this problem is to partition the histogram into two 
sections. The first is for stationary tracks, i.e. very slow moving ones. This section is a 
two-dimensional histogram and so may use a fine cell size without requiring large 
amounts of storage. The second histogram section is for moving objects of different 
10 speeds. Although this is a four-dimensional histogram, a coarser cell size may be used 
for such objects so the size of this section of the histogram also need not be large. The 
size of the second partition may also be reduced by using a cell size which varies with 
speed. 

15 There are a number of alternative ways of constructing the histogram, to minimise 
memory requirements. These include quad tree and other sparse matrix representation 
techniques. 

The histogram describing behaviour may be built up over a fixed period of time and 
20 remain unchanged thereafter. This has the disadvantage that slow changes in actual 
behaviour, drift in camera electronics or sagging of camera mounts etc, may ultimately 
result in normal behaviour being classified as abnormal. This may be overcome in a 
number of ways. In the preferred method, the histogram is periodically de-weighted by 
a factor close to unity, with the result that the system has a fading memory, i.e. its 
25 memory is biased towards most recent events. Depending on processor limitations, it 
may be necessary to implement the fading memory in ways to reduce processor load. 
For example, the de-weighting might be applied at a larger interval, or only a part of 
the histogram might be de-weighted at shorter intervals. Another way of biasing 
memory of behaviour to recent events, albeit not a fading memory, is to use two 
30 histogram stores, one being built up while the other is being used for track 
classification: 
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While histogramming is the preferred approach, there are other possible learning and 
classification methods, and some of these are discussed below. 

5 

The track classification stage 4 classifies each track as being 'normal* or 'abnormal' 
and the method used depends on how behaviour is being leamt in the leam behaviour 
stage 3. In the preferred histogram-based method, a track is classified by comparing 
the frequency of occupancy of the corresponding histogram cell with an occupancy 
10 threshold. Jf the frequency of occupancy is below the threshold, the track is classified 
as abnormal, otherwise it is considered normal. 

If a very low false alarm rate is required, it may be necessary to take additional steps to 
prevent an adverse system response, particularly if the training time has been limited. 

15 As an example of such steps, a track may be classified as normal, even if the occupancy 
of the corresponding histogram cell is low, if it is adjacent or near an above threshold 
cell. The distance ftom the corresponding histogram cell to the nearest above 
occupancy threshold cell (measured within the histogram by city-block, Cartesian or 
some other metric which may be occupancy weighted) may be compared with a 

20 distance threshold. If the distance is less than the threshold, the track is classified as 
normal, otherwise the track is considered abnormal. The false alarm rate can be 
adjusted by varying the occupancy threshold and the distance threshold. As an 
alternative to the use of a distance threshold, histogram data might be blurred to 
suppress the classification of tracks as abnormal in cells close to high occupancy cells. 

25 Similarly, other morphological operators might be applied. 

The alarm generation stage 5 generates an alarm signal 20 when abnormal tracks are 
found to be present, subject to additional processing logic to resolve situations such as 
30 intermittent abnormal behaviour or multiple instances of abnormal behaviour 
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associated with one real-world event, and other situations such as when spurious track 
data is generated by a tracking error. 

The risk of alarms being generated by spxirious data can be reduced by limiting 
5 processing to confirmed tracks or tracks whose track history shows a consistent history 
of associated plots. Alarms generated by spurious data can also be reduced by limiting 
processing to either: abnormal tracks which occur in close proximity (in terms of any 
of position, velocity or time) to a number of other abnormal tracks, and/or: tracks 
which are classified as abnormal on a number of occasions. 

10 

To prevent intermittent alamas, an alarm signal 20 can.be raised (subject to other logic) 
only when a track is classified as abnormal for the first time, or when a filtered version 
of the classification rises above a threshold. 

15 Multiple alarms might be generated, for example, if a vehicle viewed by the CCTV 
system takes an abnormal path and generates a number of separate abnormal tracks in 
the process - the different tracks being generated by different parts of the vehicle. 
These multiple alarms would be confusing and unwanted in a practical system. These 
can be suppressed by inhibiting alarms for a period of seconds after a first alarm. 

20 Alternatively, alarms could be suppressed if the track causing the alarm is within some 
distance of a track that has previously generated an alarm. 

A number of different methods of alarm generation logic can be envisaged, with 
different ad hoc formulations. 

25 

The above sections describe preferred methods for use within the present invention, 
although alternative learning and classification methods may be used. A selection of 
such alternatives described below. 

30 
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Fan-in/Fan-out (MLP) Neural Net. In outline, the idea is to use a multi-layer perceptron 
with 4 input nodes (for track dataX,Y, Vx, Vy) and 4 output nodes, but an intermediate 
layer of 2 or 3 nodes. The perceptron is a program that learn concepts, i.e. it can leam 
to respond with True (1) or False (0) for inputs we present to it, by repeatedly 

5 "studying" examples presented to it. The Perceptron is a neural network whose weights 
and biases may be trained to produce a correct target vector when presented with the 
corresponding input vector. The training technique used is called the perceptron 
learning rale. The perceptron is able to generalise from its training vectors and work 
with randomly distributed connections. Perceptrons are especially suited for simple 

10 problems in pattern classification. The network would be trained to reconstract its own 
input. Because of the internal constriction, the reconstruction should be better for 
"normal" tracks and worse for "abnormal" tracks. Thus, the accuracy of reconstruction 
could be used to assess the normality of the track. 

15 Nearest Neighbour. Here, tracked feature data for recent frames is retained to create a 
track history database. Each new track is tested for normality by searching this 
database to find any similar previous tracks. 

Praned Nearest Neighbour. This is a variation of the full nearest neighbour technique. 
20 The history database is reduced in size by omitting duplicates or near duplicates of 
earlier data. 

Kohonen Net. Although this is a neural net technique, this can be viewed as similar to 
the praned nearest neighbour method. The behaviour of tracked objects is described by 
25 a set of nodes positioned in the four-dimensional input space. The actual positions of 
the nodes are determined by an iterative training process. This method is also related to 
adaptive code-book generation methods used in data compression systems. 

Probabilistic Checking. This is a lateral approach to searching history databases for the 
30 nearest neighbour-based algorithms. Here, the history database is searched by 
choosing for comparison entries in a random sequence until a number of matches are 
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found. If the track being checked is very normal, a number of matches will be found 
very quickly. 



5 Accordingly, the present invention provides a video motion anomaly detector which 
addresses the problem of automatically detecting events of interest to operators of 
CCTV systems used in security, transport and other applications, by processing CCTV 
images. The detector may be used, for example, to activate an alarm to sunmion a 
human operator to view video data, to trigger selective recording of video data or to 

10 insert an index mark in recordings of video data. The video motion anomaly detector 
of the present invention extracts and tracks point-like features in video images and 
raises an alarm when a feature is behaving abnormally, compared with the *normar 
behaviour of those features, derived jfrom observations of that feature over a period of 
time. 

15 

Existing video motion detectors are devices which are essentially based on detecting 
changes in image brightness averaged over image sub-blocks. The video motion 
anomaly detector of the present invention has the advantage of being less prone to false 
alarms caused by changes in scene illumination levels. The detector of the present 
20 invention can also detect the movement of smaller objects and detect movements of 
interest, even in the presence of other moving objects. Further, it can be applied to a 
very wide range of different applications with little special setting. Compared with 
other existing event detection systems based on complex software solutions, the video 
motipn anomaly detector can be implemented in relatively inexpensive hardware. 
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Claims 

1 A method for processing video images to detect an event of interest, comprising the 
steps of : 

receiving a video signal (10) representing the video images to be processed; 

5 - extracting (1) at least one point feature from the video signal ; 

tracking (2) the position and movement of the at least one point feature 
within the video images to generate a corresponding at least one track, each 
representing a corresponding point feature; 

- using (3) an iterative learning process to derive a normal pattem of 
IQ behaviour for each track; 

comparing (4) present behaviour of the at least one track to the respective 
normal pattern of behaviour; and 

in response to the present behaviour falling outside the normal pattem of 
behaviour, generating (5) an alarm signal (20). 

15 2. A method according to claim 1 wherein the alarm signal (20) causes at least one 
of the following effects: 

- draw the attention of an operator; 

_ place an index mark at the appropriate place in recorded video data; and 

- trigger selective recording of video data. 

20 3 A method according to claim 1 or claim 2, wherein the learning process (3) 
accumulates data representing the behaviour of the track(s) over a period of 
time in a four-dimensional histogram, said four dimensions representing x- 
position, y-position, x-velocity and y-velocity, of the track(s) within the video 
image. 

25 4 A method according to claim 3 wherein the leam behaviour stage segregates the 
tracks according to a velocity threshold; wherein tracks moving at a velocity 
below the velocity threshold are considered stationary while tracks moving at a 
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velocity in excess of the velocity threshold are considered mobile; wherein data 
concerning the mobile tracks is stored in said four-dimensional histogram, data 
concerning the stationary tracks being stored in a two-dimensional histogram, 
said two dimensions representing x-position and y-position within the video 
5 image. 

5. A method according to either claim 3 or claim 4 wherein a cell size of the four- 
dimensional histogram varies in accordance with a measured speed in the image 
of each respective track. 

6. A method according to any of claims 3-5 wherein the histogram is periodically 
10 de-weighted in order to bias the result of the learning process (3) towards more 

recent events. 

7. A method according to any preceding claim wherein the comparison process (4) 
classifies a track according to a comparison of the frequency of occupation of 
the corresponding histogram cell with an occupancy threshold. 

15 8. A method according to claim 7 wherein the comparison process (4) acts to 
classify as normal behaviour a track adjacent or near a cell which is above the 
occupancy threshold, despite the track appeeuing in a cell below the occupancy 
threshold, where one cell is considered to be near another if the distance 
between them is below a predetermined distance threshold. 

20 9. A method according to any preceding claim wherein abnormal tracks are 
filtered, whereby an active alarm signal (20) is generated in response to an 
abnormal track which resembles a number of other abnormal tracks, in terms of 
at least one of position, velocity and time. 

10- A method according to any preceding claim wherein abnormal tracks are 
25 filtered, whereby an active alarm signal (20) is generated in response only to an 

abnormal track which has been classified as abnormal on a predetermined 
number of occasions. 
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11. A method according to any preceding claim wherein abnormal tracks are 
filtered, whereby an active alarm signal (20) is generated in response only to a 
track being classified as abnormal for the first time. 

12. A method according to any preceding claim wherein abnormal tracks are 
5 • filtered, whereby an active alarm signal (20) is generated only in response to a 

filtered version of the classification rising above a predetermined threshold 
value. 

13. A method according to any preceding claim wherein subsequent active alarm 
signals (20) are inhibited for a predetermined time interval after a first active 

10 alarm signal (20) has been produced. 

14. A method according to any preceding claim wherein subsequent active alarm 
signals (20) are inhibited if caused by an abnormal track within a predetermined 
distance of another track which has previously generated an alarm. 

15. Apparatus for processing video images to detect an event of interest, 
15 comprising: 

- a source of video images, producing a video signal (10) representing the 
video images to be processed; 

a feature extraction device (1) receiving the video signal and producing data 
(12) representing at least one point feature detected within the image; 

20 - a feature tracking device (2) receiving the data (12) representing point 

features and producing data (14) representing tracks, being representative of 
the position and speed of each respective point feature, within the image; 

- a learning device (3) receiving the data (14) representing the tracks and 
producing a signal (16) representing a range of behaviour considered normal 

25 by the learning device, in response to operation of a learning process on the 

data (14) representing the tracks; 

a classification device (4) receiving both the signal (16) representing the 
normal range of behaviour of the tracks and the data (14) representing the 
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tracks, being adapted to compare the signal (16) and the data (14) and to 
issue a normal/abnormal signal (18) in accordance with the outcome of such 
comparison; and 

an alarm generation device (5) receiving the normal/abnormal signal (18) 
5 and generating at least one active alarm signal (20) in response to the 

normal/abnormal signal indicating abnormal behaviour of at least one track. 

16 Apparatus substantially as described and/or as illustrated in the accompanying 
figure. 

17 A method substantially as described and/or as illustrated in the accompanying 
10 figure. 
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